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Abstract 

For a jointly measurable probability -preserving action r : MP r\ (X, /i) 
and a tuple of polynomial maps p; : M — > R D , i = 1, 2, . . . , k, the multiple 
ergodic averages 

~ A/i ° r pi(t) )(/ 2 o r P2 «) • • • (f k o t**W) dt 
J Jo 

converge in £ 2 (/i) as T — > oo for any fi, f2, ■ ■ • , fk S L°°(n). This con- 
firms the continuous-time analog of the conjectured norm convergence of 
discrete polynomial multiple ergodic averages, which in is its original for- 
mulation remains open in most cases. A proof of convergence can be given 
based on the idea of passing up to a sated extension of (X, //, r) in order 
to find a simple partially characteristic factor, similarly to the recent devel- 
opment of this idea for the study of related discrete-time averages, together 
with a new inductive scheme on tuples of polynomials. The new induction 
scheme becomes available upon changing the time variable in the above in- 
tegral by some fractional power, and provides an alternative to Bergelson's 
PET induction, which has been the mainstay of positive results in this area in 
the past. 
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1 Introduction 



Given commuting probability -preserving transformations T\, T2, . . . , T& rx (X, p), 
the study of the associated 'multiple' (or 'diagonal') ergodic averages 

1 N 

-^(/ 1 oTr)(/ 2 oT 2 ")...(AoTD 

n=l 

by now has an extensive history. Interest in them originated in Furstenberg's proof 
in |fl6Tl of the Multiple Recurrence Theorem, which effectively shows that the in- 
tegrals of the above averages stay uniformly positive as N — > 00 when Tj = T l 
for some fixed T and fa = fa = . . . = f^ = 1a for some non-negligible set 
A. That work was followed by a multidimensional generalization in Furstenberg 
and Katznelson's paper [17], and since then the above averages, various general- 
izations of them and a host of related questions have been investigated by several 
researchers: see, in particular, the papers |Q2] [13] [B], 10, ll28l . ll30l . lfl8l . 00, 
HIDEO, ED, ED and (291. 

In a sequence of breakthroughs culminating in Tao's work, it has now been shown 
that the above averages always converge in L 2 (/j,) as iV — > 00, augmenting our un- 
derstanding of their asymptotic nonvanishing due to Furstenberg and Katznelson. 
However, their generalization to 'polynomial' multiple averages, such as 

i N 

± o If W)(/a o 2f (n) ) • • • (A o 

n=l 

for pi,p2j • • • £ Z[-X], remain far less well understood. In case Ti = T2 = 
T 3 = . . . = Tfc their convergence and also a reasonably good description of their 
limit have been obtained in ll26l 1211 . and similarly in the case of more general 
commuting transformations under some assumptions of weak mixing [5] or of 
some algebraic independence among the polynomials iTTTi . A handful of other 
higher-dimensional cases are now at least partly understood ifTOl |2l |3l . However, 
the following broader conjecture remains open in general: 

Conjecture 1.1 For any probability-preserving action T : lP rx (X, p) and any 
polynomial mappings pi, P2, . . . , p& : Z — > iP the averages 

1 N 

- J2(fx o T^)(fa o T^W) • • • (f k o 

n=l 

converge in L 2 (fi)for any fx, fa, ■ ■ ■ , fk G 
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A version of this appears as Question 9 in Bergelson [6|, although it had certainly 
been proposed informally before that, by Furstenberg and others. In addition to 
the special cases mentioned above, it has been proved for general polynomial map- 
pings under the assumption that the action T is totally ergodic by Johnson in 11231 . 
In (8l, Bergelson and Leibman extend this conjecture even further to the setting 
of actions T rx (X, p) of a discrete nilpotent group T and 'polynomial mappings' 
Pi : 7L — > T; here we will omit the technical preliminaries needed to set up this 
latter notion. 

This conjecture has been verified in all special cases that have been successfully 
analyzed to date; but the partial results obtained in indicate that the general 
conjecture may still lie some way beyond the scope of current approaches to these 
results. However, in this paper we will see that the 'continuous-time' analogs of the 
above averages are rather simpler to understand, and do indeed enjoy the analogous 
convergence. 

Theorem 1.2 For any jointly measurable action (X, p) and any tuple 

of polynomials pi, p2, . . . , pt '■ R — > R D the associated multiple ergodic averages 



A T (f u f 2 , . . . , f k ) ■= 




•Pi(t) dt 



converge in L 2 {p) as T — > oo for any f\, f 2 , . .., fk G L°°(/x), where fj denotes 
the average jjj Jjfor any bounded real interval I. 

By analogy with Bergelson and Leibman [8], it should be possible to formulate an 
extension of the above conclusion that applies to a tuple of polynomial maps p^ : 
E — > N taking values in some nilpotent Lie group N and an action N r> (X, /j.) ; 
however, this seems to lead to new algebraic complications that we will not try to 
surmount in this paper. 

Theorem [L2] will be proved by induction on the set of polynomial maps {pi,P2, • • • ,Pfc}- 
An important invention from is Bergelson's 'Polynomial Ergodic Theorem' 
('PET') induction scheme: a wellordering on such sets of polynomials that he and 
many others have now used as the basis for analyzing multiple polynomial ergodic 
averages. However, in this paper we depart from this scheme, using instead a 
simple but crucial change in the time-variable that allows us to work instead with 
families of 'fractional polynomial maps', among which a different and rather more 
efficient induction scheme becomes more natural. In a sense, the availability of 
these fractional polynomial maps in the continuous-time setting indicates a much 
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greater 'smoothness' that is enjoyed by the averages of Theorem 1 1.21 than by their 
discrete-time analogs, hence leading to simplified behaviour. This flexibility in 
choosing time-changes was also shown to me by Vitaly Bergelson. 

In Section [2] we will introduce and justify these changes of variables, and reformu- 
late Theorem ll.2l accordingly. Then, in Section 3, we introduce an extension of that 
reformulated result which will be needed in order that our induction close on itself, 
and then we introduce the partial ordering on families of (fractional) polynomials 
that we will use and complete the proof. In addition to the time-change and this 
partial order, the main tool we need is the technology of sated extensions devel- 
oped in |2], which in our setting enables us to extend an initially-given M^-system 
to one in which the averages of interest exhibit simplified behaviour from which 
their convergence can be deduced. 

Before launching into technical details, we should mention two other recent works 
concerning multiple averages in continuous-time. The first, that of Potts [27], stud- 
ies the averages 



for a single flow r : E r\ (X, /i). She shows that for these averages the pow- 
erful machinery developed by Host and Kra (see [22], and also ED) on multiple 
averages for a single transformation has a direct counterpart (and in fact that the 
Host-Kra-like factors of the system (X, /i, r) that underly the resulting analysis co- 
incide with those for the single transformation r*° for a suitable choice of time-step 
to > 0). As a result Potts is able to obtain convergence and multiple recurrence 
for the above continuous averages, as well as various finer results describing their 
limit. 

A different approach to the continuous-time setting has also recently been inves- 
tigated by Bergelson, Leibman and Moreira 0. Their work is based on a direct 
reduction to analogous questions for discrete transformations, for which the de- 
sired convergence is then assumed as a black-box result, rather than showing how 
discrete-action machinery can be reconstructed in the continuous setting. This re- 
duction, in turn, is based on a very general result asserting that convergence to a 
common limit for a family of discrete Cesaro averages implies the similar conver- 
gence for an enveloping continuous Cesaro average (Theorem 0. 1 in their preprint). 
Their method gives results for the above one-parameter linear multiple averages, 




i=l 
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and also for polynomial multiple averages such as 

r-T k 

7 Wifi ° TPlit) ) At for polynomials Pi : R -> M. 

^ 4=1 

However, neither of the above approaches is currently sufficient to prove conver- 
gence for general polynomial averages in an action of W D for D > 2, because in 
both cases the authors make heavy appeal to results for the discrete-time setting 
that are already known: either by showing how structural results from that setting 
have analogs for continuous-time flows, or by simply reducing the desired conver- 
gence assertion to known facts about discrete transformations. By contrast, one 
of the surprising features of Theorem 11 .2 1 is that the analysis of continuous-time 
polynomial multiple averages in higher-dimensional actions is actually much sim- 
pler than its discrete-time relative, so that we are able to prove the theorem above 
while the behaviour of the analogous discrete-time averages remains largely mys- 
terious. Our proof uses the machinery of sated extensions, previously employed 
in (HO and related to earlier arguments from [4| and [19 ] , but which has not yet 
provided enough insight to handle the general case of Conjecture 11.11 (We should 
also remark that another consequence of using extensions in proofs of convergence 
is that relatively little can then be deduced about the structure of the limit, whereas 
Potts and Bergelson, Leibman and Moreira do obtain some such descriptive re- 
sults as well.) The greater simplicity in the continuous-time setting results from 
the greater 'smoothness' of the continuous averages alluded to above, which will 
be exploited concretely through the use of the fractional-power time-changes that 
were mentioned there. 

Acknowledgements 
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the Clay Mathematics Institute. 

2 Formulating the right question 

It turns out that one of the chief obstacles to proving Theorem 11.21 is finding the 
right conjecture: the assertion of Theorem IL~2~1 is actually too weak. Instead we will 
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consider functions fa (rather than polynomials that are vector-valued sums of 
maps with powers between and 1. This will be made simpler by also allowing 
some greater flexibility in the location of the intervals over which we take our 
ergodic averages. 



Definition 2.1 (Tempered sequence; tempered-uniform convergence) A sequence 
of bounded open intervals I n C R is tempered if\I n \ oo and there is some fixed 
K > such that dist(0, I n ) < K ■ \I n \for all n>\. 

Given now a locally integrable map v : [0, oo) — > fj into some Hilbert space, 
the averages of v over bounded intervals in R converge tempered-uniformly if 
the sequence of averages jj-^ Jj v(t) dt converges as n — > oo for any tempered 
sequence of intervals I n C [0, oo). 



In fact, tempered-uniform convergence is not really a new property of such a map 
v. Indeed, one has 




for any real a n < b n , and if the sequence of intervals (a n , b n ) is tempered then 
the coefficients b n j (b n — a n ) and a n / (b n — a n ) that appear here remain bounded 
with difference equal to 1, so the tempered-uniform convergence of the averages 
fj v(t) dt is actually equivalent to the Cesaro convergence of v. (This also clearly 
implies that the limit is the same along any tempered-uniform sequence of inter- 
vals.) The usefulness of the above definition is in providing a convenient handle for 
this convergence along sequences of intervals that may not be pinned to the origin. 



Lemma 2.2 Suppose that v : [0, oo) — ¥ S) is a norm-continuous and bounded 
function taking values in some Hilbert space. Then the tempered-uniform conver- 
gence of the averages 

jv{t)dt 

is equivalent to that of the averages 

jv{s a )ds 
for any a > 0, and the limits are the same. 
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Proof Let I = (a, b). By symmetry it suffices to prove the forward implication 
and show that the limits are the same. This is achieved by making the substitution 
t := s a in the second integral to obtain 

r b i r b a i 

4 v(s a )ds = - / v{t)-t l ' a - 1 dt. 

J a b-aj aa a 

We will now show how this can always be written as a weighted integral of averages 
of the form Aj(v) := fjv(t)dt in such a way that we can apply the tempered- 
uniform convergence of these latter. Let us write A^v) for their tempered uniform 
limit. Since the case a = 1 is trivial, this argument falls into two remaining cases. 

Case 1: < a < 1 In fact let us assume further that a / since this irri- 
tating case can be treated by exactly the same method except that the antiderivative 
of the function t i-> t x l a ~ 2 cannot be written using the usual formula that is valid 
for other a. 

We first compute that 

i r ba i lfn"Wa-i r b " 
_L_ / v{t) ± t i/*-i dt = / v{t)dt 

b — a J a a a b — a J a a 



b-a J aa a y 



-Ar aaba ){v) 
a(b — a) ^ ' ' 

cb a 



+ i r i/a-i tl/a _ 2 r v[u)dudt 

b - a J aa a J t 

a l-a(pa - a a ) 

a(b-a) A ^)^ 

^ f (b^-t^-'A^^dt. 

~ a ) Ja a 



1 — a 



a 2 {b 

Now suppose that I n = (a n , b n ) C [0, oo) is a tempered sequence of intervals, say 
with a n < K(b n — a n ) for all n. Define K n G (0, 1] by b n — a n =: K n b n , so 
that a re-arrangement of the temperedness inequality gives K n > and now a 
simple computation gives 



-^-(1 - (1 - K n f~ a ) - aK n = f 1 (1 - i)t 1/a 



2 dt, 



7 



which is uniformly positive over the possible values K n <G [1/(1 + K), 1]. There- 
fore for any e > we can select some L > K (typically much larger than K) such 
that 

^((r^r-* 1 -^--)-^-^) 

> (1 - e)(-^-(l - (1 - K n ) l ~ a ) - aK n ) 
\ 1 — a J 

for any K n <G [1/(1 + K), 1]. Defining c n := (1 - j^)6 n > we now have a n < 
c n < b n , but on the other hand c n = L{b n — c n ), so any sequence of intervals of the 
form (c' n , b n ) or ((c' n ) a , 6°) for some selection of c' n G [a n , c n ] is still tempered. 

Using the above inequality, another simple calculation gives 

i 

"(b^-t)^- 2 dt 



f 



(V 



! («) 1/Q -«) 1/a ) 



1/a 



(cl~ a - a 1 -*) ~ ^(c n - a r . 



(l/a-l) vn n 7 1/a" 

^(iM^r-™ 1 -")-^--!^)) 

> (1 - e )6 n (-^-(l - (1 - iQ 1 ^) - a^) 
V 1 — a / 

= (l- £ )( IT7 ^(^-a^)- T ^^-a.)) 

= (l-e) [""(b^-t^^dt. 

Combining this with the change-of-variables made above and using that v is a 
bounded function we can write 



a i^a n — a n j j a a 



where H-R1II2 < ^IMIl^Qu.oo)-^)' Now as n — > 00, any sequence of intervals I' n 
such that l' n = (t, 6°) for some a" < t < c° for each n also satisfies dist(0, i^) < 
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K'\I' n \ for some K' > 0. Hence the tempered-uniform convergence of the averages 
Ai{y) to some limiting vector, say A^iy), implies that we must actually have 

\\ A (t,b%)(.v) < e||u||i«>([o, o)->a) v * G K><£) 

for all sufficiently large n. Inserting this approximation into the above average, we 
deduce that for all sufficiently large n we have 

A—afua „a\ 



4 v(s a )ds = 

Ja n a{b n - a n ) 

/ {b«-t)t 1 / a - 2 A 00 (v)dt + R 2 + R 1 

J a* 



1 — a 
a 2 (b n - a, 

«n""(&n-<) 
a(b n - a n ) 



Aoo{v) 



+ a 2 (b r , 



\ - r / 6 "(6° - t)t 1/a - 2 ^oo(^) dt + R 3 + R 2 + R 1 

b n — (In) J a % 

= A^v) + R 3 + R 2 + Ri 
with H-R2II, H-R3II < £ IMlL°°([o,oo)->-i5)- Since e was arbitrary we conclude that 

-j- v(s a ) ds ^ Aoo(v) as n — > 00, 

as required. 

Case 2: a > 1 This is similar, except now we start with the computation 

— — / vti)-^"- 1 dt = ^-^ / «(*)dt 

b-a J aa a b-a J aa 

+- / v(t)-(t l/a - 1 - (b a ) l/a ~ l ) dt 

b-a J aa a y 



1 a 

A(a«,b«){V) 



a(b — a) 

fb' 

-f 



^(o«,6«)(^) 
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a(b — a) 

+ ^)r <t -°" )ti/ ""- 4 <-" ( "> d4 - 
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Just as before, given a tempered sequence of intervals (a n , b n ) we can approximate 
the above expressions arbitrarily well by truncating the second integrals so that they 
involve only ^4( a ^,t) f° r < t < 6" for some suitable sequence a n < c n < b n , 
and then argue that all of the expressions -A( «,t) f° r sucn t converge uniformly 
fast to Aootv), so that the left-hand integrals must do the same. Note that in this 
case, if the values a n actually tend to 0, then this argument also requires the fact 
that 1/a — 1 > — 1 and so the function t i-> t l l a ~ l is locally integrable at 0: this 
is needed so that the small range (a", c"] that we initially omit from the second 
integral on the right-hand side can be chosen so as to give an arbitrarily small 
contribution overall. □ 

Now let us examine a little the taxonomy of the functions that result from applying 
such fractional-power time-changes to polynomials. 



Definition 2.3 (Fractional polynomial; height; degree; goodness) If d € N then 

a map 4> : [0, oo) — > M. D is a fractional polynomial ( f -polynomial' ) of height d if 

it takes the form 

*(t) = Xy/S- 

3=1 

for some tuple of vectors vi, V2, . . . , v<j € M D . Note that there may be more than 
one possible choice of the height d for a given map <p, and in the following we 
will need to keep track of the height as well. We allow the possibility that = 0, 
and let the degree of <p, denoted by deg0, be the largest fraction j/dfor which 
Vj 7^ 0; we also say that <p is of top-degree if deg <p = d/d = 1. 

A height-d f-polynomial is good if the list of vectors vi, V2, Vd-dcgcp is 
linearly independent (including the assertion that they are all non-zero). Note that 
this property depends on the choice of height: for example, given <\> as above, we 
may always re-write it as a sum Yl'j'Li t^ 2d w'- in which \'- = Ofor all odd j. 

Also, we set 

V(4>) := span{vi, v 2 , . . . , v^}, 
which agrees with the linear span of the image <f>(M) C M. d , and we set 

0"(t) : ^V'S 

3=1 

(so <fi~ = c() when <p is not of top-degree). 
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Definition 2.4 (Good families of f-polynomials) A family <f> = {0 X , (f> 2 , . . . , (f> k } 
off-polynomials, say expressed as 

d 

<t>i = J2 ti/d Vi,j M i = 1,2,... , d, 

is good if each of the individual f-polynomials <^> i is good, and moreover all of the 
non-zero vectors Vjj appearing in the above expressions are linearly independent. 

We can now give the simple reformulation of Theorem 1 1.2 1 in terms of fractional 
polynomials: 

Theorem 2.5 For any action r : W D (X, p), any good family off-polynomials 
4>i, 02' ■ ■ ■ ' 4>k '• [0> °°) ~~ an d an y tempered sequence of intervals I n C 
[0, oo) the associated averages 

A n (f 1 J 2 ,...,f k ) :=/ H(fiOT*M)dt 

Jin i = 1 

converge in L 2 (fi) as n — > oo for any f\, f% ■ ■ ■ , fk £ 00 (/•*)• 

Proof of Theorem 11.21 from Theorem 12.51 This relies on a simple changed of 
variables. Suppose that T : R D r% (X, p), that 

d 

Pi(*) = ^2 tJu i,j fori = 1,2,..., A;, 

and let «(t) := nf=i(/i ° T Pi ®), so this is a norm-continuous and bounded map 
into L 2 (/i). 

We first reduce the case in which all the u^ j are non-zero and linearly independent. 
Introduce a formal collection of vectors Vj j for i < fc and j < d that are all non- 
zero and linearly independent. Let M D be their linear span, and define a linear 
map A : R D ' -> M D by setting 

(so there may be linear dependences among these images, and some of them may 
be zero). Now define V : R D ' r\- (X, p) by (T') v := T Av and also 

d 

p'i(t) = i J Vjj for each i = 1, 2, . . . , k, 

3=1 
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and observe that T Pi ^ = (T') p ^*). Hence we can write our multiple averages 
in terms of T' and the p[, and so it suffices to prove Theorem 11.21 in case the 
coefficient vectors uy are all non-zero and linearly independent. 

Assuming this, define 

d 

0i(fl) := JV'Vj fori = 1,2,..., d, 

i=l 

and observe that this is a good family. On the other hand, by Lemma 12.21 the 
norm convergence of the averages v(t) dt follows from the tempered-uniform 
convergence of the averages 



k 

I V (s 1 / d )ds = if[(f i OT^)d t 
J I Jl l=1 



This completes the proof. □ 



3 The full induction 

3.1 Furstenberg self -joinings and partially characteristic factors 

Here we reformulate in our present setting some older machinery from the study 
of multiple ergodic averages. 

First, suppose that we have already established convergence for some family {cf> 1 , <f> 2 , 
of f-polynomials [0, oo) — > M. D , that r : M. D rx (X, y) is a system and that Ao, A\, 
A 2 , . . . , Aft are Borel subsets of X. Then our assumption implies that the scalar 
averages 

/ n(A n r-^ i(i) (^i) n • • • n r-** ( *>0A fc )) dt 
Jo 

converge as T — > oo. Denoting the limiting value by ^ (Aq x A\ x 
■ ■ ■ x Ak), it is now easy to check that this may be extended by multilinearity and 
continuity, and that this actually defines a (A; + l)-fold self-joining fj^ ^ of 
the system (X, fj,, r) (which depends also on r, although we suppress this in our 
notation). This construct has its origins in Furstenberg's original work on multiple 
recurrence lTl6l . and is referred to as the Furstenberg self-joining associated to 
the family {<p 1 , <f) 2 , . . . , 4>k}- This will prove an important tool for analyzing our 
averages, much as in the previous works in 11121 |3T1 14] |Z|. 
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Next, a factor £ : (X, fj,, r) — > (Y, z/, a) is partially characteristic for a given 
tuple of f-polynomials <f>^, (f> 2 , . . ., (f) k if 



for any /i, f2, fk € I 00 ^) and any tempered sequence of intervals C 
[0, oo), where we write F n ~ G n to denote that — G n \\2 — > as N — > oo. 
This notion is based on a definition that first appears in Furstenberg and Weiss' 
paper [18], and which has gone through a number of incarnations since. Note that 
it involves operating only on the last function f\ in the list. 

As in most previous proofs of convergence for some family of multiple ergodic 
averages, the heart of our induction will be finding a partially characteristic factor 
that has some additional structure allowing averages of interest to be re-written 
into a simpler form. Here we will also make crucial use of a more recent twist 
on this strategy, in which an initially-given system must first be extended (that is, 
expressed as a factor of some 'larger' system) before the desired factor can be 
shown to be partially characteristic. This approach has been developed in Hill El 
3]. Here we will appeal to the very general machinery of sated extensions from J2] 
in order to make this initial construction of an enlarged system. 

3.2 Satedness 

Following O, a class C of standard Borel probability-preserving M D -systems is 
sated to be idempotent if it is closed under isomorphisms, inverse limits and (not 
necessarily ergodic) joinings. These conditions are enough to guarantee that any 
such system (X, /i, r) has a maximal factor Q. : (X, [i, r) — > C(X, fJL, r) whose 
target system is a member of the class C. In these terms, a system is C-sated if 
whenever n : (X, fx, f) — > (X, fi, r) is an extension, the factors it and are 
relatively independent over the further common factor £J o 7r. More concretely, the 
C-satedness of (X, /i, r) means that if / € L 2 (fi) and we prove that 



Here we will be concerned with idempotent classes of the following kinds. If V < 
M D is a vector subspace then we may associate to it the class Zq of IR^-systems 




k 



k-l 
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whose y-subaction is trivial; and now given several subspaces V\, V2, ■ ■ ■ , Vg < 
R D we let Z^ 1 V Z^ 2 V • • • V Z e be the class of all joinings of systems drawn 
from each of the classes Zj\ Both of these examples are readily verified to be 
idempotent (or see Section 3 in [2 ]), and if we abbreviate (l v =: Cv tnen a simple 
check also shows that 

1 

1=1 

as factor maps of (X,fx, r), in the sense that these maps define the same factor 
of t up to negligible sets, and where the right-hand side denotes the factor map 
generated by the individual factor maps . 

In these terms, we now make the following definition, which closely follows the 
idea of a 'fully isotropy-sated' system from J2]. 

Definition 3.1 (Fully rationally sated systems) We will write that (X, //, r) is fully 
rationally sated ('FRS') if it is sated for the idempotent classes 

vev 

whenever V is a finite collection of rational subspaces of MP, where a subspace 
V < MP is rational if it has a basis consisting of members ofQ D . 

The usefulness of this definition derives from the following general fact, which is 
an immediate special case of Theorem 3.1 1 in J2j: 

Theorem 3.2 (Sated extensions exist) Every M. d -system has an extension that is 
FRS. □ 

The principal limitation of Theorem 3.11 in |f2] is that it can be applied to at most 
countably many families of idempotent classes, and this is why we must restrict 
our attention to rational subspaces in Definition 13.11 



3.3 An ordering on fractional polynomial families 

The proof of Theorem 12 . 5 I will require an induction on good families of f-polynomials 
4> , and so we make a separate step of introducing the relevant ordering on these 
families, and introducing two particular kinds of 'downward movement' through 
the collection of such families that will appear during the induction. 
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Definition 3.3 (The precedence ordering) Given two non-empty good families of 
height- d fractional polynomials, say <f) = {<t>\, 02> ■ ■ ■ > <Pk} an d W = {^1^2^ ■ ■ ■ > 
we say that ip precedes <p, written tp -< <p, if 

• £ < k, 

• when the (p i and ip i are ordered so that their degrees are non-increasing in 
i, one has 

deg tp i < deg i Vi < -£ 
(so f/ie (pifor I < i < k are not needed here), and 

• either £ < k, or if I = k then strict inequality holds in the second condition 
above for some i < £. 

In addition, we always have ip -< ip and <p are families of fractional poly- 
nomials of distinct heights d and d' > d. 



It is clear that this defines a partial order on the collection of all families of height-d 
fractional polynomials, and that it satisfies the descending chain condition. Note 
that the inequality £ < k is not by itself enough to guarantee that tp -< <p : it is 
also necessary that the former family not have too many high-degree members, in 
the sense of the second point above. 

Suppose now that = {<p 1 ,4> 2 , . . . , 4> k } is a good family with k > 2. Two 
special kinds of precedent for <p will be important in the sequel: 



• On the one hand, suppose that j\ G {1, 2, . . . , d} is minimal such that there 
is some <p ix G <p for which the leading term has degree j\/d. In this case 
the family 

^:={^-0 n : ie{l,2,...,k}\{h}} 

precedes <p . Indeed, we have removed one instance of degree j\ / d, and for 
every i ^ i\ we have that (p i — <p ix still has degree at least deg <p i > j\/d (the 
goodness assumption implies that all coefficients of <p i and <p { are distinct, 
so there can be no cancellation of coefficients here, and deg <p i > ji/d by 
the minimality of ji). Also, tp is still a good family since the collection of 
its coefficients is 

{Vij : i € {1, 2, . . . , fc} \ {h}} for j € {h + 1, . . . , d} 
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and 

{vjj-v^j-: i e {1,2, ...,k}\{h}} forj<ji, 
and it is clear that these are all still nonzero and linearly independent. 
We refer to a family ip constructed this way as a precedent of of type I. 

• On the other hand, if <\> i € is a top-degree member then the family 

{01, 2) • • • 3 0i-i3 0, r > 0i+i> • • • 3 0fc} 

has swapped out an entry of degree deg i and replaced it with an entry of 
degree deg i — 1 (where in case d = 1 we instead take the above to mean 
that 0j has been omitted altogether). So again it clearly precedes and is 
still good. We call this a precedent of of type II. 



3.4 The main induction 



In order to formulate an inductive hypothesis that includes Theorem 12.51 and can 
be closed on itself, we will actually prove a composite of three different properties 
of the multiple averages associated to each family . To this end we insert the 
statement of Theorem 12.5 1 as the second of the three related conclusions. 

Theorem 3.4 Suppose that (X, /x, r) is an MP -system and that = {0 1; 2 , . . . , 0^} 
is a good family of height-d /-polynomials M — > W D with expressions 

d 

0*w = E tiAV M' 

where each Vjj € Q D . Then the following hold: 

A( ): if (X, /i, r) is FRS and <p k E is of top-degree then the factor 

fc-1 

£ := ^Kv ft , d v V 
1=1 

i's partially characteristic for the averages 

A/(/i,/ 2) ...,/ fc ) : =/n^° rPi(t) d * 

1=1 

associated to 0; 
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B( <fi ): the averages Aj(f\, /2, . . . , fk) converge tempered-uniformly in L 2 {p) for 
any /i,/ 2 , ■ ■ ■ , fk G L°°{ji); 

C( 4> ): the Furstenberg self-joining fif d ^ ^ ^ exists and is invariant under the 
off-diagonal flow 

t i-> id x r* Vl ^ X T tV2 'J X • • • X T* V ^ 

/or each j < d. 

Clearly the assumption that v ? J € Q D involves no loss of generality in that we 
may simply change basis to make it true, but it does fix what we mean by 'FRS' in 
the statement of conclusion A( <p ), and for this it is important. 

In the above notation, for D € {A, B, C} we will write D(^( <p) for the assertion 
that D (lp) holds for all ip -< <fi. 

Our next task is to establish the base case of Theorem E 



Lemma 3.5 The conclusion A(</>)VB(0)vC(</>) holds if cp contains only one 
member, say (pi. 

Proof In this case we have 



with <pi(t) = Sj=i t^ d Vj- Clearly by omitting extraneous directions we may 
assume that R D = R d = V((px). Applying LemmaO in reverse, we see that it 
suffices instead to work with the polynomial Pi(t) = Ylj=i^ v j- However, for 
this we can simply regard the action r as a unitary flow U on the Hilbert space 
L 2 (/i) using the Koopman representation, so that the above average (with pi in 
place of cpi) becomes 

We can now show that this converges in norm to the conditional expectation (/ | CL D ) , 
which agrees with the orthogonal projection of / onto the subspace of [/-invariant 
functions in L 2 (/j,). This is most easily proved by appeal to the Spectral Theorem, 
which translates this assertion into the fact that for any finite measure 6 on R D 
(which arises as the spectral measure assocated to /) the functional averages 



X ^^x(PlO))ds 



17 



converge tempered-uniformly to the indicator function at the origin, l{o}> in L 2 (8). 
This in turn is clear using the Dominated Convergence Theorem, because these 
functions are all bounded by 1 and if x = (xi> X2, ■ ■ • > Xd) 7^ then the above 
average is 

r. Ct 

whose convergence to uniformly as \I\ — > oo (without any assumption of tem- 
peredness) follows by Weyl's Equidistribution Theorem (see, for example, Kuipers 
and Niederreiter ll24l ). 

This already contains the conclusion B(0), and it also contains A(0) because 
is contained in £ as a factor map. Finally, C(</>) follows by applying this 
convergence inside the definition 

Mf d ^(A x AO := Urn f M (A n r^VO) dt 

= [ l Ao ■ ( lim / l Al oT*i(*>d*W 

since this now gives that ^ is the relatively independent product of two copies 
of /i over the factor £J , which is invariant under everything. □ 

It remains to show the induction step. Three different arguments will be used to 
complete one iteration of this, and we present these separately. 

Proposition 3.6 If <fi contains at least two members then 



Proof This is the most difficult of the three parts of our inductive implications. It 
will rest mainly on a reduction from to a precedent of type I. 

Suppose that f\, f € L°°(fj,) are such that 

A/ n (/i,/2, • • • ,/fc) A 

in L 2 (/i) for some tempered sequence of intervals I n C [0, oo). We will show that 
in this case we must also have E^(/^ | £) ^ 0; from that point the proof is com- 
pleted simply by substituting the decomposition f k = E M (/ fe | £) + (f k - E M (/& | £)) 
for a general function fk and appealing to linearity in 
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Suppose that ^(t) = Y$=i t j/dv i,j for i = 1, 2, . . . , A; and that jx e{l,2,...,d} 

— > 

is minimal such that some element of <j> has degree jx / d; by re-ordering we may 
assume that it is 4> 1 . 

We begin by applying the version of the van der Corput estimate for continuous 
families of vectors (see, for instance, Potts' Appendix B in ETl ) to the functions 

k 

i=i 

to deduce from the non-convergence Aj n /2, • • • , /fc) A that we also have 

JO Jin JX i=1 i=1 

as n — > oo and then H — > oo. 
We may write the above integral as 

Jo .//„ Jx fJi 

= r H rr A^^n^-^rw^-j-^o^)^^ 

JO J/„ ~^ 

and now use the ^-preserving transformation r^ 1 w to change variables to obtain 
J" -f [ II (((/ i oA< 1+ ^-« 1 »r^).]g or^'l-^W) d^dtd/i. 

io .//„ i=1 

Now, since </>~(i + /i) — (j>~[{t) — > for any /i G R as i — » oo, by the strong 
continuity of r we also have 

f iOT <t>-{t+h)-<t>~(t) T hv lA _ f. OT hy i , d+ ^f. OT <i>T(t+h)-^{t)_f i j OT h^ i , d _^ f iOT hv ^ d 

in || • || 2 for any h as t — >■ oo (and hence for (1 — o(l))-proportion of the interval 
J n as n — > oo). Since all of these functions are also uniformly bounded in || • H^, 
replacing them one-by-one implies that as n — > oo the above integrals behave 
asymptotically the same as 

J" -f I IT(((^ or?iVl ' d )-^) or</>lW ^ lW ) d ^ dtd/l - 

JO Jin JX i=1 
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Now consider the family tp = {ip 2 i ^3) • • • > V'fe} defined by 

V> f (t) := 0i(t) - 

which is a type-I precedent of <^ . The inductive assumption of C( i/> ) implies that 
the above integrals converge as n — > oo to 

fe 

^((/^r^j.^d^ ^dfe, 

and that the Furstenberg self -joining ^ ^ ^ =: that appears here is 

invariant under each of the flows 

t h-> id x r *( v 2,i- v i^) x r^ V3 ^- V1 -^ x • • • x r *( v w -Vl ^ 

for j = 1,2,...,<£ 

We now invoke the sleight-of-hand that underlies the previous works H |2] [3j on 
nonconventional averages: we define a new jointly measurable action 
X k by setting 

ptvij ._ (t 1 *™) xk whenever i < k - 1 and / 

and 

CT tv M . = r tv w x r tv 2 ,, x . . . x r tv fclj for aU ^ < ^ 

(noting that v^j ^ for all j < d, by the assumptions that <p k is of top-degree and 
is good). There is no conflict in this definition, because by the goodness assumption 
on 4> all the non-zero vectors Vjj are linearly independent. 

In terms of a, the measure fi F -> is invariant under a tVt ^ for any i < k — 1 (since 

id, if> 

this is just an element of the diagonal action r xfc ), and also under each of the flows 
<7 tVfc >j for j < d, as a consequence of hypothesis C(ip) and the identities 

^tvfcj _ x T t(v 2 j-v hj ) x T t(v 3 ,j-vi,j) x . . . x r t(Vfe,j— Vlj)\ ocr *vij 

In particular, since /i F _a is <T* v * d -invariant, the usual Mean Ergodic Theorem im- 

id,V 

plies that that the averages over h G (0, H) of the functions 

(g) (/i o r h ^) = ( (g) /*) o o**m e £°°0< d ^) 

i=l i=l 
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converge in II • IU as H — > oo to some g € L°°(u F ->) that is cr Rvfcd -invariant. 

id,i/> 

Inserting this convergence into our integral averages, we deduce that they must 
tend to the limit 

k 



Xk i=l 

and therefore that this latter is not zero. 
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On the other hand, writing 7Tj : X k — > X for the i th coordinate-projection, we 
see that for each i = l,2,...,k — 1 and each j for which Vj j / 0, the function 

,F ): „ ^u-*™. 



/i o 7Tj G L°° (// F ) is manifestly invariant under the flow 



I | ^ r *(vij-Vi,j) x T-t^j-Vi.j) x . . . x r *(vfc,j— Vi,j) = g-Hykj-Vij) 

because the action of this flow on the i th coordinate is trivial. 

It follows that the R^-system (X k , /jF^ a) is an extension of (X, /j,, r) through 

the factor map TTk, and that among functions on this extended system we have 
that g is measurable with respect to d and that fi o m is measurable with 
respect to Cr( v . _ Vfe ) ^ or eacn i < k — 1 and j < d, and hence with respect to 
Cv(<p.-<p k )- Therefore the non-vanishing of the above integral tells us that the lifted 
function fk o -k^ on the extended system (X h ,ri F -*, a) has nonzero conditional 

idx,V> 

expectation onto the factor 



fc-i 



i=l 

Since (X, /j,, r) is sated for the idempotent class of all systems that can appear as 
such a factor (recall that one of the assumptions for assertion A( <f) ) is that (X, /j,, r) 
is FRS), it follows that in fact we must have E M (/fc | £) / 0, as required. □ 

Proposition 3.7 If <f> contains at least two members then 

A(^)VB(4)^B(^). 



Proof First note that if d'/d < 1 is the maximal degree of any member of 0, 
then performing the time-change s := t d l d defines a new family of f-polynomials 
<f>i(s) := cf) i {s d l d ) of height d' which has coefficient vectors precisely the same 
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as for , so it is still good. By Lemma [2721 it suffices to prove the desired conver- 
gence for the averages associated to these new f-polynomials but this is now an 
instance of B(-< 0) because d! < d and so this time-changed family precedes 0. 

Therefore we may assume that has some member of top-degree, say fc . In this 
case the result follows from Theorem 13 . 2 1 using a reduction to a precedent of type 

n. 

Next, it is clear that the averages Aj associated to the family converge for a 
system (X,fi,r) if they converge for some extension it : (X,jl,f) — > (X, /u,r) 
of that system, simply by lifting each function f\ to f\ o n, so in proving B(0) 
we may assume that (X, fj,, r) is FRS. Given this, A( ) tells us that it suffices to 
prove convergence when ff, is measurable with respect to the factor 

fc-i 

i=l 

This, in turn, implies that it can be approximated in || • \\% by a finite linear com- 
bination of functions of the form g ■ Y\j =2 hj with g invariant under T^ Rvfc d and 
each hi invariant under r^^* - ™, where t^ v denotes the restriction of r to an 
action of the subgroup V. A simple continuity argument now shows that it suffices 
to prove convergence when equals a finite linear combination of such products, 
and now by multilinearity it actually suffices to consider just one such product. 

However, having reduced to this case our averages enjoy the simplification 

fU(fior^)dt = fH(fiOT^)-((gh2h 3 ---h k )oT^)dt 
^ T i=i •' I i=l 

. fc-i 



i=i 



since g o r^*W = g o t^* (*) for all t, and 0j(i) - (f> k (t) € V{4>i - 4>k) and so 
hi o r**W = hiO r**W for all t and aU i < k - 1. 

If 0^" is nontrivial the convergence of these latter averages in L 2 (fx) follows from 
that for a system of averages associated to the f-polynomial family {cf> 1 , 2 , ■ ■ • , 4>k- 
This is a type-II precedent of , and so this convergence follows from the induc- 
tive assumption of B(^( 0). On the other hand, if 0^7 is trivial (which can occur 
only if all of the maps <f) i were actually linear), then we may simply move g out- 
side the above integral altogether and simply apply hypothesis B for the precedent 
family {0 1; 02,..., fc _ 1 }. □ 
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Proposition 3.8 If <fi has at least two members then 



Proof Once again we may assume that cj> has a member of top-degree, say 4> k , 
and in this case will use another reduction to a precedent of type II. 

The Furstenberg self-joining uF -* exists provided the averages 

id, cp 

-f T n(A n r _ *W(Ai) n T-^ t \A 2 ) n • • • n T-^ {t) {A k )) dt 
Jo 

converge as T — >• oo for all Borel subsets Aq, A\, A k C X, and this now 
follows at once from B( cf> ) by writing the above scalar- valued averages as 



/ l^o • A(p,T)0-AiAA 2 ,- ■ ■ , U fe )d/i. 
Jx 



It remains to prove the invariance of fi F -> under the asserted off-diagonal flows. If 

id, <p 

7T : (X, /i, f) — >■ (X, t) is any extension then clearly fi F is the pushforward 
of /i F -> under 7r x ( fc+1 ), and 7r x ( fc+1 ) also intertwines each transformation 

idx> <t> 

idx x f* Vl ' J x • • • x f tVk -J with idx x r iVl - J x • • • x T tVk > j , 

so by Theorem 13 .2 1 it will suffice to prove this invariance assuming that (X, /u, r) 
is FRS. 

Thus, suppose that /o, f\, f 2 , ■ ■■ , fk € L°°(ti), j < d and t G R. For each j < d 
we will show that 

/o ® /i ® /a <8> • • • ® /fc d// -> 

/o ® (/l o r' V1 -- ) <g> (/ 2 o r< v ^ ) ® • • • ® (f k o r iv ^ ) d/i F -> , 

from whence multilinearity and a simple approximation argument complete the 
proof. 

By the hypothesis A(^) we know that the factor 

fc-l 

i=i 
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is partially characteristic for the averages -A(o,t) associated to cf> . Hence the ex- 
pression for the above integral as a limit of integrals of fo • -A(o,t)(/i> f2, ■ ■ ■ , fk) 
gives that 



J X k + l ld > 



and similarly 

/o ® (/i o r tv ^) ® (/ 2 o r tv ^) ® ••• ® (/ fc o r tv ^)d// 



id,^ 

/0 ® (/l O T tVl - ) ® (/ 2 O T tV2 » ) ® ■ ■ ■ ® (E^M O T*^ ) d// -> 

(because £ is respected by r tVfc ^ for each j). Therefore it suffices to prove the 
desired equality under the assumption that /& is ^-measurable. 

However, with this in hand we may approximate / as g ■ Y\i=i hi in just the same 
was as for the proof of Proposition 13.71 and now rearranging according to the dif- 
ferent invariances of the functions g and hi gives 

fo ® fl ® /2 ® • • • ® {gh\h 2 ■ ■ ■ hk-i) dfi F -* 

lim / f ■ Ar 0T) (fi,f 2 ,...,gh 1 h 2 ---hk-i)dn 

lim / fo-A7 T Jf 1 h 1 ,f 2 h2,...,fk-ih k - 1 ,g)dn 

/o®(Mi)®(/a/Hj)®---®(/fc-i/ifc-i)®^d^ <6 A , 

Xk + l la i < Pl! < P2r--)<? > fe-l 

where we let ^4^, T \(f[,f 2 ,---,ff.) denote the multiple averages associated to the 
type-II precedent family 

{01,02) • • -,4>k}- 

(As in the previous proposition, we must tweak this step in case cj)^ is trivial, but 
then we simply let A^ QT ^ be associated to {<p\, 4>2> • • • i 0fc-i}' which is still a 
type-II precedent, and simply move g outside these averages altogether.) 

On the other hand, since hi o r tVk ^ = hi o t <Vi >j , an analogous calculation gives 
fo ® (/i ° r tV1 -) ® • • • ® (Q?^ • • • fc fc _i) o r /v ) ,!//'' 



id,0 



/ /o®((Ml)°r' Vl >0®- ' ■®((/fc-i/ifc-i)or tv «-w)®(P°T iVM )dMM < 4 s 

l X k+l ia ><Pl>'P2>- 
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The proof is finished by treating two separate cases. 

Case 1: j < d — 1 In this case the vectors Vjj for i < k are all still among 
the coefficient vectors of the precedent family {cj> 1 , 4> 2 , ■ ■ ■ , 4>^ }> an d so applying 
the inductive assumption of conclusion C for that family gives the desired equality 

fo ® (Mi) ® (Ma) ® ■■• ® (AMfc-i) togd/i* 



X k+i 



/o®((Mi)°t* V1 '' )®- • ^((/fc-ifefc-Oor^-^)®^^ ) d/£ 

Xfc+i "Wi><?>2> 

Case 2: j = d In this case we cannot make a simple inductive appeal to con- 
clusion C for the precedent family, because the vector v^ ^ is not among the co- 
efficient vectors for that family: it has been removed from (f>7. However, we also 
know that g is r^ Rvfc d -invariant, so in this case we write 



X k+i 



/(^((MiH^)®- . .®((/ fc _ 1 / lfc _ 1 )or tVk - 1 .« l )®( & or tVh .'') dfi 



and now we can apply the inductive assumption of condition C to the precedent 
family, since all the remaining vectors Vj^, i < k — 1, still appear as coefficients 
in that family. This gives the desired equality again, and so completes the proof. □ 

Completion of the proof of Theorem 13.41 Suppose for the sake of contradiction 
that there were some good family of f-polynomials q> for which A(0)VB(0)V 
G( (f> ) fails. Since -< satisfies the descending chain condition this implies that there 
is a minimal such family, and by Lemma [331 it must have at least two members. 
However, for this family we know that A(-< 0) V B(-< 0) V C(^ (f>) is true. 
Therefore Proposition 13.61 implies that A(0) is true; feeding this additional in- 
formation into Proposition 13 .7 1 gives that B(0) is true; and now feeding this into 
Propositino 13.81 gives that C(0) is true. This amounts to a contradiction, and so 
completes the proof. □ 



4 Further questions 

The ability to use fractional-power time-changes, 1 1— > t a , makes the study of poly- 
nomial multiple averages in higher-dimensional continuous flows much simpler 
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than its discrete counterpart. As remarked in the introduction, the older Conjec- 
ture 11.11 for polynomial multiple averages in a Z D -action remains open, and it 
seems that the methods of the present paper offer little new insight into it. 

However, within the setting of continuous actions there are other relatives of The- 
orem [L2]that may be of interest in their own right. 

First, it is natural to ask whether the convergence of Theorem l 1 . 2l holds for arbitrary 
real intervals of increasing length: that is, whether the averages . . . , /&) 

converge in L 2 (^l) as |J| — > oo, irrespective of how far the intervals / lie from 
the origin in R. This question already creates difficulties for the use of fractional- 
power time-changes, since the correct replacement for Lemma I2T21 is not so clear 
outside the regime of tempered sequences of intervals. 

A second question, posed by Vitaly Bergelson, asks whether the polynomial maps 
Pi appearing in that theorem could be replaced by members of some more general 
class. Natural candidates here are members of the Hardy field of logarithmico- 
exponential functions, provided we choose such functions that do not grow too fast 
or too slowly. More broadly still, it might be possible to single out a suitable class 
of maps R — > R D in terms of a Fejer-like condition on the asymptotic behaviour of 
their derivatives as t — > oo. In the setting of a single discrete transformation, some 
such extensions of the multiple recurrence and convergence theorems are known 
from the work of Bergelson and Knutson [7 ], who discuss the various growth con- 
ditions for these maps that are natural in this context. We will not stop to formulate 
a precise conjecture here, as handling such more general maps would require too 
much preparation, but refer the reader to that paper for an introduction that can 
easily be transferred into the continuous-time world. 

Another natural avenue for generalizing Theorem 11.2 l is to allow polynomial maps 
R r — >• R-° for some higher-dimensional domain space R r . In this case the formu- 
lation of our more general question is clear: 

Question 4.1 Is it true that for any action r : R~° rx (X, fi), any fi, fz, ■ ■ ■ , fk G 

L°°(fi) and any polynomial maps : R r — > MP for i = 1, 2, . . . , k that the 
associated multiple averages 



I have not tried to extend the methods of the present paper to handle this question, 




converge in L 2 



(fi) as R — > oo? 
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but would not be surprised if that were possible. Indeed, it should not be hard to 
extend the ideas of Section |2] to develop a notion of a 'tempered' F0lner sequence 
of domains F n C W , growing in size as n — > oo relatively fast compared to 
their distance from the origin, which is robust under a suitably rich collection of 
fractional-polynomial changes of variables 

(ti,t 2 , ■ ■ ■ ,t r ) !->• (t-L t 2 ■■■t k ,...,t 1 t 2 ■■■t k )■ 

However, even after introducing this flexibility, I do not at once see the right ex- 
tension of the precedence ordering of Subsection I3.3l for fractional-polynomials in 
several variables, and so a little more work may be needed to find the right induc- 
tion scheme if the above assertion to be proved this way. 

Lastly, a more ambitious generalization in a similar vein would ask instead about 
an action of a connected nilpotent Lie group r : N rx (X, /i) and polynomial 
maps W — > N. Starting with Leibman's work [25] on multiple recurrence for 
actions of discrete nilpotent groups, there has been some progress on extending 
multiple recurrence and convergence theorems to nilpotent actions, and it might be 
hoped that the study of polynomial averages in an M D -action serves as a natural 
step towards such averages for a connected nilpotent group. However, the greater 
complexity (albeit polynomial) of the multiplication in a nilpotent group makes it 
unclear whether the device of fractional-power time-changes yields similar simpli- 
fications as in the case of M. D -actions, and I suspect that a more serious departure 
from the arguments above would be needed to settle this question. 
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